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Please amend the paragraph beginning on page 2, line 31, and Continuing to page 
3, line 3, as follows: 



It has become apparent from the above that the conventional automatic 
recognition system 100 depicted in Fig. 1 analyzes the input speech signal 
range up to 4 kHz by sampling the analog input speech signal at 8 kHz 
sampling rates may be used as well. For example, personal computers 
sampling rate of 1 1 kHz which represents 1/4 of the 44t3t44.1 kHz used 
of CDs. It is evident that a higher sampling bandwidth is connected wiih 
information so that the performance of automatic speech recognition S] 
increases if higher sampling rates are employed. 



speech 
in a spectral 
Of course, higher 
often use a 
for the sampling 
more spectral 
generally 



systems 



Please amend the paragraph beginning on page 3, line 21, and 
3, line 26, as follows: 



continuing to page 



The speech analysis in this network system is based on a MEL 
subbands. The number of 23 MEL subbands is kept constant for all thr 
This means that the subbands are differently distributed over each of 
ranges of 4, ^§5.5 and 8 kHz (corresponding to the sampling rates of 
to be analyzed. 

Please amend the caption on page 4, line 9, as follows: 
BRIEF SUMMARY OF THE INVENTIO N 



Mterbank with 23 
se sampling rates, 
the three spectral 
11 and 16 kHz) 
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Please amend the paragraph beginning on page 4, line 11, and continuing to page 
4, line 18, as. follows: 

According to th e inv e ntion, aA speech analyzing stage of an automatic speech 
recognition system proposed for analvzin g analyzes a speech signal sampled at one of 
at least two different system sampling rates in the spectral domain^*. T le speech 
analyzing stage compra s ing - comprises a first spectral analyzer for analyzing the speech 
signal up to a first frequency and a second spectral analyzer for analyzing the speech 
signal at least above the first frequency. 

Please amend the paragraph beginning on page 4 f line 20, and ^continuing to page 
4, line 26, as follows : 



A method a eeording - to the invention for analyzing in the spectnil 
signal sampled at one of at least two different system sampling rates of 
speech recognition system comprises a first analysis step for analyzing 
up to a first frequency and a second analysis step for analyzing the 
above the first frequency. 



Please amend the paragraph beginning on page 4, line 28, and 
4, line 37, as follows: 



domain a speech 
an automatic 
the speech signal 
speech signal at least 



continuing to page 



provided, each 



According to the invention, a At least two spectral analyzers are ] 
spectral analyzer analyzing the speech signal in the spectral domain. Tie first spectral 
analyzer analyzes the speech signal in a lower spectral range having an upper frequency 
limit which is defined by a first frequency. The first frequency is prefe rably derived from 
the lowest system sampling rate. The lowest system sampling rate is the lowest sampling 
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rate occurring in an automatic speech recognition system in which at lejast two different 
sampling rates are utilized. 



Please amend the paragraph beginning on page 5, line 13, and 
5, line 21, as follows: 
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< :ontinuing to page 



The first spectral analyzer according to the inv e ntion ensures a Irigh compatibility 
among the components of an automatic speech recognizing system working with several 
system sampling rates since for all sampling rates a compatible set of acoustic parameters 
can be obtained. This compatible set of acoustic parameters is generated by the first 
spectral analyzer which independently from the sampling rate, i.e. even for the lowest 
sampling rate, always parametrisizes.an identical spectral range up to t le first frequency. 



Please amend the paragraph beginning on page 5, line 35, and 
6, line 2, as follows: 



continuing to page 



A further advantage of the invention is the fact that the recognition 
automatic speech recognition system can be simplified because the cor ipatible 
acoustic parameters allows to perform pattern matching using one and 
matching unit for acoustic parameters sampled at different sampling rates 



stage of an 
set of 
the same pattern 



Please amend the paragraph beginning on page 9, line 19, and 
9, line 28, as follows: 



continuing to page 



stage. 



The invention t echnology also relates to a data signal to be 
terminal to the network server having a central speech recognition 
comprising a first data structure relating to the sampling rate and a 
containing at least one codebook index derived from a codebook for a 
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combination of one or more acoustic parameters obtained by analyzing 
up to a first frequency and one or more further acoustic parameters 
the speech signal at least above the first frequency. 



Please amend the paragraph beginning on page 10, line 4, and 
10, line II, as follows: 
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the speech signal 
obtained by analyzing 



continuing to page 



The invention t echnology c an be implemented for example as a 
and as a computer program product comprising program code portions 
individual steps of the invention when the computer program product i 
automatic speech recognition system. The computer program product 
computer readable recording medium like a data carrier attached to or 
system component. 



hardware solution 
for performing the 
i$ run on an 
ijaay be stored on a 
removable from a 



Please amend the paragraphs beginning on page 10, line 28, ar\d continuing to 
page 11, line 12, as follows: 



Fig. 3 



is a block diagram of an automatic speech recognition 
comprising a first embodiment of a speech analyzing 
to the invention ; 



Fig. 4 



is a block diagram of a distributed speech recognition system 
according to the invention ; 



5- 



system 
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Fig. 5 



is a block diagram of a second embodiment of a 
stage according to the invention which may be 
systems of Fig. 4; 



speech 



used 



Fig. 6a to 6c 



Fig. 7 



are block diagrarams showing the use of codebookjs in speech 
analyzing stage s according to the invention ; and 



is a schematic diagram of a data signal according tp the inv e ntion to 
be used in the DSR system of Fig. 4. 



Please amend the paragraph beginning on page 12, line 24, ancf continuing to 
page 12, line 33, as follows: 



The problems encountered with the automatic speech recognitiojn 
in Fig. 2 are overcome by the automatic speech recognition system 10 
and comprising a first embodiment of a speech analyzing stage 12- oeefoding 
inv e ntkm . The automatic speech recognition system 10 further comprises 



stage 14 with a single pattern matching unit 16. The pattern matching 
pattern matching based on the acoustic parameters received from the s 
stage 12 and based on reference models which are stored in a database 
Fig. 3. 



analyzing 
in the DSR 



system depicted 
depicted in Fig. 3 
to - the 



a recognition 
unit 16 performs 
peech analyzing 
not depicted in 



Please amend the paragraph beginning on page 13, line 22, anq, continuing to 
page 13, line 33, as follows: 



The first spectral analyzer 18a arranged in the first speech anal^jzing 
the speech analyzing stage 12 is configured to analyze the speech signal 
range up to fewest- This upper spectral boundary of the spectral range 
spectral analyzer 18a was derived from the lowest system sampling 



:rate 



-6 



branch 12a of 
ill a spectral 
analyzed by the first 
2 xf lQWC5t by 
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18a 



multiplying the lowest system sampling rate 2 x fewest The 
boundary of the spectral range analyzed by the first spectral analyzer 
chosen such that it equals less than half the lowest system sampling rati 
if the lowest system sampling rate is 8 kHz, fewest could equal 3t§3J> kHz 



Please amend the paragraph beginning on page 16, line 4, and continuing to page 
16, line 8, as follows: 



invention is 



fe-Fig. 4 - depicts an example DSR system 200 according to the 
depicted . The DSR system 200 comprises a single network server 210 
recognition stage 210a. The network server 210 communicates via wirid 
communication links 212 with three terminals 214, 216, 218, e.g. mobile 



with a central 
or wireless 
telephones. 



Please amend the paragraph beginning on page 17, line 14, anq continuing to 
page 17, line 20, as follows: 



As can be seen from Fig. 5, the second spectral analyzer 18b 
range between 4 and 5^5.5 kHz and outputs M (M > 1) additional 
relating to the speech energy in this spectral range. The third spectral 
analyzes the spectral range between 5.5 kHz and 8 kHz and outputs 
additional acoustic parameters relating to the speech energy in this 



spectral] 



Please amend the paragraph beginning on page 18, line 19, anq continuing to 
page 18, line 27, as follows: 



The further terminal 216 operated at a sampling rate of 1 1 kHz 
constructed using a speech analyzation stage 216a with only the first 
analyzing branches 12a, 12b depicted in Fig. 5 since a speech signal 
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spectral 
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anidyzes 



the spectral 
acoustic parameters 
ajialyzer 18c 
N(Ni 1) 
range. 



could be 
speech 
salrapled at 1 1 kHz 
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does not comprise spectral information above kHz. For the same reason the speech 
analyzation stage 214a of the terminal 214 operated at a sampling rate of 8 kHz would 
only require a single speech analyzing branch similar to the speech analyzing branch 12a 
depicted in Fig. 5. 



Please amend the paragraph beginning on page 21, line 12, anc } 
page 21, line 20, as follows; 



In Fig. 6b, determination of the codebook index for the acoustic 
c 2 obtained at a sampling rate of 1 1 kHz within the terminal 216 is 
codebook 28b deviates from the codebook 28a depicted in Fig. 6a in 
28b comprises a further column E^s^ relating to the speech energy 
speech signal in a frequency range between 4 kHz and 5,5 kHz, 
is determined by a speech analyzing branch simitar to the 
depicted in Fig. 5. 



Please amend the paragraph beginning on page 21, line 33, and continuing to 
page 22, line 3, as follows: 

In Fig* 6c, determination of the codebook index for the acoustic parameters c lf c 2 » 
E*^^ and E^s^s obtained within the terminal 218 at a sampling rate of 16 kHz is 
illustrated. The codebook 28c depicted in Fig. 6c deviates from the coc.ebook 28b 
depicted in Fig. 6b in that it comprises a further column for the acoustic parameter E^ss- 
8 . The codebook index for the quadruplet of acoustic parameters Cj, C2> B^sss and E&^-s 
is determined in a similar manner as described above with reference to Fig. 6b. 



continuing to 



parameters c x and 
illustrated. The 

the codebook 
con|tained within the 
value of E^s^ 
speech analysing branch 12b 
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Please amend the paragraph beginning on page 23, line 4, and Continuing to page '• 
23, line 27, as follows: 



transmitted 



In Fig. 7, the overall data structure of a data signal 50 to be 
of the terminals 214, 216, 218 to the network server 210 is depicted. Tljie 
comprises a first data structure 52 which contains synchronization inf ohnation 
header information relating to the sampling rate at which the corresponding 
has been sampled. A second data structure 54 of the data signal 50 contains 
data substructures 56 to 68. The first data substructure 56 relates to the 
energy which was linearly quantized with eight bits. The six remaining 
relate to the codebook indices for the six pairs of acoustic parameters ci 
table). The second data substructure 58 contains the seven-bit codeboojc 
generated as explained above with reference to Figs. 6a to 6c. This means 
codebook index contained in the second data substructure 58 was derived 
codebooks 28a, 28b, 28c for a specific combination of the first two at rustic 
Ci, C2 and the one or two further acoustic parameters E^^, E^^. 8 w! 
by analyzing the speech signal in upper frequency ranges- The five further 
substructures 60 to 68 depicted in Fig, 7 relate to the five further pairs 
parameters depicted in the above table. Altogether, the seven data subs|tructures 
contain 44 bits of information. 



Please amend the paragraph beginning on page 24, line 23, an§ continuing to 
page 26, line 2, as follows; 



From the table it can be seen that for HMMs trained at a sampl 
high recognition accuracy (word error rates smaller than -jkO?l X)7 %) 
independently of the sampling rate of the speech signal. Moreover, a 
when moving from the lowest sampling to higher sampling rate and 



from one 
data signal 50 
and 
speech signal 
a plurality of 
logarithmic frame 
data structures 
to c 12 (see above 
index which was 
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from one of the 
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rich were obtained 

data 
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ihg rate of 16 kHz a 
c|an be expected 
glain can be seen 
applying HMMs 
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trained at the respective sampling rate. No further gain can be achieved when moving 
from 1 1 kHz to 16 kHz. This is not surprising because there exists only little spectral 
information in speech signals above $$5.5 kHz. From the table it also secomes clear that 
independently from the sampling rate of a terminal a high recognition performance can 
be obtained for all HMMs. 
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